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'do' 

yn^ ■ Abstract. We consider the Monge-Kantorovich transport problem in an ab- 

^•—^ ' stract measure theoretic setting. Our main result states that duality holds 

' ^ ' if c : X X Y — > [0, oo) is an arbitrary Borel measurable cost function on the 

product of Polish spaces X, Y. In the course of the proof we show how to 
relate a non - optimal transport plan to the optimal transport costs via a 
"subsidy" function and how to identify the dual optimizer. We also provide 
(J"\ ' some examples showing the limitations of the duality relations. 
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1. Introduction 



q 

'~i I We consider the Monge-Kantorovich transport problem for Borel probability mea- 

a • sures /x, i/ on Polish spaces X, Y. See }RR98[ IVil03[ IVil05| for a general account 

of the theory of optimal transportation. The cost function c : X x y ^ [0, oo] is 
assumed to be Borel measurable. n(/i, v) is the set of all transport plans, that is, 
Borel probability measures on X x F which have X-marginal /z and F-marginal v. 
^ I The transport costs associated to a transport plan tt are given by 

OO 

VO '. (1) ^cM = / cdiT 

JxxY 

and we say that tt is a finite transport plan if /c[7r] < oo. The infimum over 
r~^ . all possible transport costs /c[7r],7r G n(/i, j/) will be denoted by Ic. We define 

^^ ' $(/i,i/) as the set of pairs {4>,il^) of integrable functions : X — > [—00,00) and 

00 . ip : Y -^ [—00, 00) which satisfy (j){x) + 4'{y) < c{x, y) for all [x, y) G X x y. The 

dual part of the Monge-Kantorovich problem then consists in maximizing 



(2) J{(l),^)= / (t)d^i+ ipdv 

Jx Jy 

for {(j>,ip) € <&(//, i^). Monge-Kantorovich duality asserts that inf{/c[7r] : tt £ 
II{n,v)} = sup{J((/), -0) : ('/'j'0) £ '^'(a*!^)}- Foi' example, if X and Y consist 
of N points, each having measure 1/X, duality holds true, as this reduces to an 
elementary linear programming problem. In the literature duality has been estab- 
lished under various conditions, see for instance [Vil05|, p 98f] for a short overview. 
In particular it is known that duality holds if c is lower semi-continuous (see [Kel84j 
Theorem 2.2] or |Vil05| Theorem 5.10]) or just Borel measurable and bounded by 
the sum of two integrable functions (; [Kel84[ Theorem 2.14]). In |RR95[ IRR96] the 
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problem is investigated beyond the realm of Polish spaces and it is characterized 
for which spaces duality holds for all bounded measurable cost functions. 

Our main result is that Monge-Kantorovich duality holds in the case of a finite 
but not necessarily bounded Borel measurable cost function. 

Theorem 1. Assume that X, Y are Polish spaces equipped with Borel probability 
measures fi, v, that c : X x Y ~* [0,cxd] is Borel measurable and fi <S> v-a.e. finite 
and that there exists a finite transport plan. Then 

(3) h = mfj/cM : n G n(/i, j/)} = sup{ J(0, iP) : (0, ^) G <^{p, ly)}. 

In contrast to the situation where c is lower semi-continuous, duality does not 
hold in general in the Borel setting if c attains the value oo on a large set, see 
Example 14.11 



1.1. Existence of dual maximizers. In general it is not possible to find dual 
maximizers (j) and ■0 for ([2]) which are integrable, even if the cost function is the 
squared distance on R (see Example 14.51 below). However it is possible to find 
dual maximizers in a weaker sense for all Borel measurable cost functions which 
are /u i8) I'-a.e. finite. Assume that tt is a finite transport plan and that 0, f/; are 
arbitrary functions satisfying (t>{x) + ipiv) 1^ c{x,y). While (l){x) + ipiy) is not 
necessarily integrable with respect to ^ (8" J^, it can be integrated with respect to -k 
(possibly assuming the value — oo). Thus we may well define 

(4) J[cP,^)^ I [cP{x)+ij{y)\dn[x,y). 

Jxky 

The subsequent lemma shows that the notation J((/), V') is justified in the sense that 
this definition does not depend on the particular choice of tt. 

Lemma 1.1. Let X,Y be Polish spaces equipped with Borel probability measures 
/I, v and let c : X X Y ^ [0, oo] be a Borel measurable cost function. Let tt, tt G 
n(/i, z^) be finite transport plans and assume that (f> : X ^ [—00,00),^ : Y — > 
[—00,00) are such that 4>{x) + tpiv) 1^ c{x,y) holds n-almost surely as well as tt- 
almost surely. Then 

(5) / [4>{x) + ^{y)] dTT{x, y)^ I [(/.(x) + ^{y)] dn{x, y). 

JxxY JxxY 

Moreover there is a sequence {4'm'4'n)n>i of bounded functions in $(/i, j/) such that 
lim„^oo J(0n,i/'n) = JxxY i^i^) + '/'(z/)] dTi{x,y). 

Using Lemma 11.11 Theorem [1] follows from Theorem [5] below. Theorem [5] is 
stronger in the sense that it guarantees that the supremum on the right side of ([3]) 
is in fact a maximum if J(0, ip) is defined as in Q. 

Theorem 2. Assume that X,Y are Polish spaces equipped with Borel probability 
measures /i, v, that c : X x Y ~f [0, 00] is Borel measurable and /i ® v-a.e. finite 
and that there exists a finite transport plan. Then there exist Borel measurable dual 
maximizers <^, "0; ^-G- functions (j) : X ^ [— 00,00),?/' : Y — > [—00,00) satisfying 
4>{x) + ip{y) < c{x, y) for all {x^y) ^ X xY such that 

(6) /, = inf{/c[7r] : tt G n(/i,j/)} = J{^A)- 
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We point out that the hypothesis that c is ^ z^-a.e. finite is crucial for the 
existence of dual optimizers (see Example 14. 3p . Even in the case of a continuous 
cost function c : XxY ^> [0, oo] , it is in general not possible to find dual niaximizers 
although duality holds in this case (see Example 14. 2p . 

In |AP03l Theorem 3.2] it is proved that in the case of a finite lower semi- 
continuous cost function there exist integrable dual niaximizers (0,V') G $(/i,i^), 
provided that 

>0. 



(7) ni^^x eX : c{x,y)diy{y) < oo}) 

(8) i^({y eY : c{x, y) dn{x) < oo}) 



Their argument yields that functions (/>, "0 which are dual maximizers in the sense 
of Theorem [2] are /i- resp. z/- integrable whenever ([7]) and ([8]) are satisfied. 

1.2. Dual maximizers and strong c-cyclical monotonicity. Theorem[3]is also 
connected with the notion of strong c-cyclical monotonicity introduced in fSTOS' . A 
transport plan tt is strongly c- cyclically monotonqj if there exist Borel measurable 

functions : X — > [— oo, oo), f/; : y — > [— oo, oo) such that 4'{x) + ipiy) < c{x, y) for 
all {x,y) e X X y and (j>{x) + ip{y) = c{x,y) for 7r-almost all {x,y) ^ X x Y. In 
[BGMS 08] it is proved that in the case of a finite cost function, a transport plan 
is strongly c-cyclically monotone if and only if it is optimal. We want to point out 
that this is also a consequence of Theorem [2j 

Corollary 1.2. Assume that X, Y are Polish spaces equipped with Borel probability 
measures fi, v, that c : X xY ^ [0,oo] is Borel measurable and jjl ® v-a.e. finite 
and that there exists a finite transport plan. 

(1) Let TT be a finite transport plan and assume that </) : X — > [— oo, oo), tjj : Y ^ 
[—00,00) witness that tt is strongly c-cyclically monotone. Then J{(f),ip) = 
IcIt^], thus n is an optimal transport plan and (^,1/' are dual maximizers. 

(2) Assume that it is an optimal transport plan, i.e. Ic[Tt] — Ic- Then -k is 
strongly c-cyclically monotone. In fact, this is witnessed by every pair (0, ^) 
of dual maximizers. 

Proof. Given a transport plan tt and functions 0, V' witnessing that tt is strongly 
c-cyclically monotone, we have 

(9) Ic[tt] = / c{x, y) dTT{x, y)= [(/.(x) + ^{y)] dn{x, y) = J((/., ^) < h. 

JXy.Y JXY.Y 

Thus 1c\t^\ — Ic, hence tt is an optimal transport and (j),Tp are dual maximizers in 
the sense of Theorem [2] 

Conversely assume that tt is an optimal transport plan and that cf), tp are dual 
maximizers. Then 

(10) = h[TT] - J(0, V3) = / c{x, y) ~ [0(x) + ^(y)] dTT{x, y). 

JXy.Y 



This notion is called strong c-monotonicity in IST08I iBGMSOSl . We find it, however, more 
consistent with previous notations in the literature to call it strong c-cyclical monotonicity. 



4 MATHIAS BEIGLBOCK AND WALTER SCHACHERMAYER 

Hence c{x,y) — [4>{x) + ^{y)\ = for 7r-almost all {x,y) ^ X x Y . Thus, cjijip 
witness that vr is strongly c-cyclically monotone. D 

1.3. Continuity of I^- The fact that Monge-Kantorovich duality holds for not 
necessarily finite lower semi-continuous cost functions is related to a certain con- 
tinuity property of the mapping c i-^ /c which is always satisfied in the lower 
semi-continuous setting. 

If the cost function c : X x F — > [0, oo] is lower semi-continuous, there exists 
a sequence (c„)n>i of bounded continuous functions such that c„ t c. For each 
such sequence we have Ic ] Ic- This can easily be derived from the fact that 
n(//, v) is weakly compaco Pick for each n > 1 a transport plan 7r„ such that 
/c„ [tTk] < /c„ -l- 1/n. By passing to a subsequence if necessary we may assume that 
(7rn)n>i converges weakly to some transport plan tt G n(/i, v). Then 

(11) Ic < Ic[t^] = lim / Cmdn = lim lim / c„idTT, 



(12) < lim lim / Cnd-Kn = lim /c„. 

m — >oo \ n — >oo J J n — *oo 

Since /c„ < Ic it follows in fact that /c„ t Ic- Observe that /c[7r] — Ic, i-e. tt is a 
primal optimizer. 

A direct consequence of this simple continuity result is that as soon as we have 
shown the relatively easy result that duality holds for bounded continuous functions, 
it already follows for an arbitrary lower semi-continuous function c : X xY ^ [0, oo] . 
To see this, pick a sequence of bounded continuous functions Cn : X x Y ^ [0, oo) 
and for each n > 1 a pair of integrable functions (</)„, V"™) such that J ((/)„, ipn) > 
/c„ - 1/n and (/>„(a;) + V^„(y) < c„(x, y) ( < c{x, y)) on X xY. Then 

(13) snp{J{(f>n,'tpn}> lim (/c„ - 1/n) = /c, 

ri>l n^oc 

thus duality holds. 

A similar continuity property holds in the case of a finite measurable cost func- 
tion. However we do not know how continuity in this sense can be shown directly, 
instead we achieve it as a consequence of Theorem [TJ 

Corollary 1.3. Assume that X, Y are Polish spaces equipped with Borel probability 
measures ii, v, that c : X x Y ^ [0,oo] is Borel measurable and fi (8) u-a.e. finite 
and that there exists a finite transport plan. Then IcAn T Ic ■ 

Proof. Given e > there exist bounded functions (p : X ^ M., t/j : Y ^ M. such that 
J{(j), ip) > /c — e by Theorem [1] and Lemma 11.11 For all large enough n, we have 

(t>{x) +'ip{y) <{cA n) {x, y) on XxY. Thus Ic„ > J((/), 'ip)>Ic-s. D 

1.4. Cost functions with negative values. For notational convenience we have 
chosen to consider only non- negative cost functions, but this restriction is somewhat 
stronger than necessary. Theorem [1] does remain valid (and in fact so do our other 
results) in the setting of a cost function c : X x Y —^ [— oo, oo], provided that c is 
fi(^ v-a.e. finite and that there exist integrable functions a : X —> [— oo, oo), b : Y —> 
[—00,00) such that 

(14) aix)+b{y)<cix,y) 



This is a consequence of Prokhorov's Theorem, see |ViI05l p 56]. 
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for all {x,y) Cz X X Y . This is an immediate consequence of Theorem [1] applied to 
the cost function c(x, y) — a{x) — 6(2/ ]j- 

2. C-CYCLICAL MONOTONICITY 

A transport plan tt is c-cyclically monotone if it is concentrated on a Borel set 
r C X X Y which is c-cyclically monotone in the sense that 

n 

(15) ^c{xi+i,yi)-c{xi,yi)>0. 

1=1 

for all {xi,yi), . . . ,(xn,yn) G T. (Here we let Xn+i — xi.) Heuristically, n is c- 
cyclically monotone if it cannot be enhanced by means of cyclical rerouting. Hence 
it is intuitively appealing (and obvious in the finite setting) that optimal transport 
plans are always c-cyclically monotone. In fact it can be shown that in the case 
of a Borel measurable cost function c : X x Y ^ [0,oo] every optimal transport 
plan is c-cyclically monotone, and that the two notions are equivalent if c is finitely 
valued ([BGMS08, Theorem 1]). (This equivalence is not true in general, as shown 
by a beautiful counterexample due to Ambrosio and Pratelli, |AP03[ Example 3.5]. 
The connection between optimality and c-cyclical monotonicity was also studied in 
ICtMQBI lAPOSl IPraOSI ISTHS] . ) 

The concept of c-cyclical monotonicity is crucial for the Monge-Kantorovich dual- 
ity. We shall review its connection with strong c-cyclical monotonicity. As indicated 
by the name, it is almost obvious to see that strong c-cyclical monotonicity implies 
c-cyclical monotonicity: Assume that (f>, ^ witness that tt is strongly c-cyclically 
monotone. Then vr is concentrated on the set F — {{x,y) : c{x,y) = (j){x) + "0(2/)} 
and 

n 

(16) ^c{x,+i,yi)-c{xi,yi) = 

i=l 

n 

(17) ^c(x,+i,2/i) - [(j>{x^) + ipiyi)] > 

i=\ 

n 

(18) Y. W^+^) + V'(y.)] - Wi) + i'ivi)] = 

i=l 

for all (11,2/1), . . . , {xn,yn) G r. Less trivially, the subsequent proposition asserts 
that the strong version can be deduced from the usual one, provided that c is finitely 
valued. (This is in general not the case if c attains cx) on a large set, see Example 
lU below.) 

Proposition 2.1. Assume that X,Y are Polish spaces equipped with Borel prob- 
ability measures fi, v, that c : X x Y —> [Q, cxd] is Borel measurable and /i ® v-a.e. 
finite and that there exists a finite transport plan. Then every c-cyclically monotone 
transport plan tt is strongly c-cyclically monotone. 

Proposition 12.11 can be proved using a well known construction given in [Riis96| . 
see also |Roc66| . |Vil03[ Chapter 2] and [AP03| Theorem 3.2]. Assume for notational 



■^Throughout this paper we use the convention 00 — 00 = 00. 
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convenience that n is concentrated on a c-cyclically monotone set F which satisfies 
px[r] — X and pyp] — Y and that c is finite on X x Y. Then the definition 

(19) (t){x) :=inf< ^ [c{xt+i,yi)-c{xi,yi)] : {xi,yi), ..., (a;„,y„) G T,Xn+i = x \ 

(20) ^{y):=mlc{x,y)-^x), 

where xi € X is an arbitrary fixed point, yields functions witnessing that n is 
strongly c-cyclically monotone. Strictly speaking, it might be necessary to alter 
and tp on appropriately chosen null sets to ensure that they are Borel measurable 
functions, but these are merely technical obstacles which we will not discuss at 
this point. Instead we shall below derive Proposition 12 . 1 1 rigorouslv from the more 
general result in Proposition 13. 61 

The subsequent statement summarizes how c-cyclical monotonicity connects to 
the other concepts discussed so far. 

Proposition 2.2. Assume that X, Y are Polish spaces equipped with Borel prob- 
ability measures /x, v, that c : X x Y -^ [0, oo] is Borel measurable and fj, (8) v-a.e. 
finite and that there exists a finite transport plan. Let 0, ijj be dual maximizers in 
the sense of (RP. Then the set F := {(x, y) : 4){x) + ipiy) — c{x,y)} is c-cyclically 
monotone. 

For any finite transport plan n the following conditions are equivalent. 

(a) TT is concentrated on F. 

(b) n is c-cyclically monotone. 

(c) TT is strongly c-cyclically monotone. 

(d) TT is optimal. 



Proof. To see that F is c-cyclically monotone, argue as in (|T6|) . As regards the 
equivalence of (a) to (d), (a) trivially implies (b). We have seen above that (b) and 
(c) are equivalent and (c) and (d) are equivalent by Corollarv ll.2l 

If TT is an optimal transport plan, (/), tp witness that tt is strongly c-cyclically 
monotone by CoroUarv 11.21 (2) such that 0(x) -I- ^(y) = c(a;, y) holds for 7r-almost 
all {x,y) & X xY. Hence (d) imphes (a). D 

3. Transports with subsidies 

In the last section it was described how dual maximizers can be constructed 
starting from a c-cyclically monotone transport plan. However, in the absence of 
lower semi-continuity of c, there is no reason why one should find a transport plan 
which is supported by a c-cyclically monotone set, even in very regular situations 
as is shown by the subsequent easy example. 

Example 3.1. Let {X,ii) ~ {Y,iy) equal the unit interval [0,1] equipped with 
Lebesgue measure. Define the cost of moving x to y by c(a;, y) = (x — y)^ for x ^ y 
and let c{x, x) = 1. Clearly it is possible find transport plans with arbitrarily small 
costs, but the infimum is not attained. Thus there exists no optimal and hence 
no c-cyclically monotone transport plan. 

Observe that dual maximizers exist; just set = V = 0- 

Our attempt to overcome this difficulty is to introduce a certain subsidy function. 
To explain this notion we take up the anecdotal interpretation (see |Vil05| Chapter 



DUALITY FOR BOREL MEASURABLE COST FUNCTIONS 7 

3]) where {X^ji) models the Parisien bakeries (i.e., croissant sellers) and {Y, v) the 
Parisian cafes (i.e., croissant buyers) and c{x,y) denotes the transport cost from 
bakery x to cafe y. To avoid technicalities we suppose that there are only finitely 
many cafes and bakeries in Paris and that c(x, y) is finitely valued. (We are not 
sure to which degree this assumption corresponds to reality.) Suppose that n is 
the traditional way how the croissants are transported from the bakeries to the 
cafes and that, for whatever reason, the Parisian authorities want to maintain this 
transport way also in the future. If the difference a between the present costs Ic[t^] 
and the cheapest possible transport costs Ic is strictly positive and the authorities 
do not intervene, they should expect that market forces will sooner or later cause 
the transport scheme to switch from tt to some other tt with lower total transport 
cost. Therefore they might try to introduce a subsidy system, where the transport 
of each croissant from x to y is subsidized by an amount f{x, y) G [0, cxd). 

The aim of the Parisian authorities is to design the system /(x, y) of subsidies in 
such a way that the daily total subsidies F = Jxxy ■I' '^^ effectively paid are mini- 
mized under the constraint that the bakers and cafetiers have no rational incentive 
to change the traditional transport system tt by cyclically rerouting their ways of 
transportation. 

A moment's reflection reveals that a lower bound for the cost of subsidy is given 

by 

(LB) F^ [ f diT > a = Ic[tt] - Ic, 

and it will turn out that this lower bound is attained which should not be very 
surprising. In fact, there are (at least) two versions of the "no incentive to change" 
constraint: 

n 

(Wl) ^c{xi+i,yi) - {c{xi,yi) - f{x^,yi)) > 

i=l 

for all (xi, yi), . . . , (xm Vn) in the support of n or 

n 

(^1) X! (c(2;»+i,yi) - f{xi+i,yi)) - {c{x^,y^) - !{x^,yi)) > 

for all (xi, yi), . . . , (x„, y„) in the support of n. 

An interpretation of the two requirements goes as follows. In (|W1|) the author- 
ities make a "take it or leave it" proposal to the bakers and cafetiers: if you stick 
to the transport system tt we pay the subsidies /, if not we pay nothing. Hence 
comparing the transport costs from Xi to yi with the ones from Xi+i to yi, we have 
to make sure that for every collection (xi, j/i), . . . , {xn,yn) in the support of tt the 
subsidized costs J2^=i c{xi,yi) — f{xi, yi) are less than or equal to the non-subsidized 
costs after rerouting X]r=i ^(s^i+i, J/i), which amounts to (IWip . 

In the interpretation of ()Sip the Parisian authorities behave in a less authoritar- 
ian way: they promise to pay the subsidies f{x,y) independently of whether the 
bakers and cafetiers are obedient or not, which amounts to the constraint (jSip . 

In fact, and this seems somewhat surprising, the validity of (IW1[) implies that 
there exists a function / such that F = /^^y/d7r = a and such that the subsequent 
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constraint (|S2|) which is yet stronger than (jSTj) is satisfied: 

n 

(S2) ^ {c{xi+i,yi) - f{xi+i,yi)) - {c{x^,y^) - f{x^,y^)) > 



4=1 



for all (xi, yi), . . . , (x„, ?;„) G X X y (not necessarily being in the support of tt). 

Replacing (xi, yi), . . . , (x„,2/„) by (a;„, y„_i), (a;„_i,y„_2), • • • , (a;2, yi), (a;i, j/„) 
one verifies that one must have equality in (|S2[) for all (a;i, j/i), . . . , (x„, ?/„) G X xY. 
This may be interpreted as follows: there is a subsidy function f{x,y) with total 
subsidy payment F — ^^^y f di: ^ a and such that, for the subsidized transport 
cost we have that /xxy(''~ /) "^^ ^^ equal for any transport plan tt G n(/i, v) and if 
(jLBp holds true, one easily verifies that this value must equal Ic- In particular, the 
bakers and cafetiers have no incentive to change tt as they are, in fact, indifferent 
between all the possible transports tt G n(M, v)^ if their goal is to minimize the total 
subsidized transport costs. 

In order to prove the existence of a subsidy system f{x,y) satisfying (|S2[) and 
(jLBp we use the following constraint which strengthens (jWip in a similar way as 
dSU strengthens fSTj) : 



(W2) ^c{xi+i,yi) - {c{xi,yi) - f{x„yi)) >0 

for all (xi, yi), . . . , (xn, y„) G X X F. 

We refrain from giving an intuitive interpretation of (|W2p . Rather we try to 
indicate on an intuitive level why all four versions of the constraint are equivalent 
when minimizing F = Jxxy ^ '^^' More precisely, suppose that there is a function 
f{x,y) such that Jx^y I dii — a satisfying (the weakest form of) constraint (jWip 
and let us show that there exists a function / with J f dfr — /c[7r] — Ic for any 
transport plan tt. Hence / satisfies the (strongest form of) constraint (|S2p . 

To pass from (jWip to (|W2p is a cheap shot: observe that (jWip as well as 
fy -^ / dTT only pertain to values of / on the support of tt. Hence we may alter / 
outside the support of tt to be oo (in the case of finite X and Y we clearly may 
replace cx) by a sufficiently large real number), which then trivially satisfies (IW2I) . 
To alleviate notation we still denote by / the function satisfying (jW2p . To pass 
from (|W2p to (jS2p we observe the subsequent "sandwich" type result which seems 
interesting in its own right. 

Denoting c{x,y) = c(x,y) and c{x,y) := c(x,y) — f(x,y) inequality (IW2p may 
be written as 

n 

(W3) y^^c{xi+i,yi) ~ c{xi,yi) > 

1=1 

for all {xi,yi), ..., {xn, yn) E X xY. 

We shall show that (|W3p implies (at least in our present setting of finite spaces 
X and Y and a finite cost function c = c) that we may find functions (j){x),'ijj(y) 
such that 

(W3a) c{x,y)>(f>{x)+'ip{y)>c{x,y), 

for all {x,y) G X. 
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To motivate why this resuh should indeed be considered as a sandwhich theorem 
note the easy fact that a function c(x, y) on X x Y may be written as 

(21) c{x,y)^cj){x)+i:{y) 
if and only if it satisfies 

n 

(22) ^c{x,+i,yi)-c{xi,yi) = 

for all (xi, yi), . . . , (x„, y„) e X x F. The problem under which conditions a func- 
tion defined on the product of two sets can be decomposed as the sum of two 
univariate functions is studied in detail in [BL92] . 

A precise version of our sandwich theorem under the assumption that li^c are 
Borel measurable and c is /^(g)i^-a.e. finite is given in Proposition l3.6l In our present 
situation it guarantees the existence of functions (/), ^ satisfying 

(W3b) c{x, y) > c^{x) + ^{y) > c{x, y) - f{x, y). 

for all (x, y) ^ X y.Y . This allows us to define 

(23) />,y):=c(a;,y)-(0(x)+V(y))- 

The lower bound of (|W3bp implies that /(x, y) > c{x, y)— {(f){x)+^j{y)) = f{x, y), in 
particular the desired bound for the total cost of subsidy Jxxy f '^^ — Ixxy -f ^^ ~ 
a holds true. The subsidized cost function (c — /) is of the form 

(24) ic-f)ix,y)=^ix)+^{y) 



and hence satisfies the (strongest form of) constraint (|S2p . 

We have observed above that (jS2p implies that Ic = /xxy ('^^ ■^) '^^ ^°'" ^^^ finite 
transport plan tt S Il{^,v). Thus /xxy('^~ /) '^'^ — /xxy'^(^) + ^iu) dTrix^y) — 
J{(j),'ijj) yields that 0, ■0 are dual optimizers. 

Finally, note that as a consequence of the fact that the total costs Jxxy f '^^ ^^ 
our subsidy system cannot be less than a together with the point-wise inequality 
f > f implies that / and / coincide on the support of tt. 

3.1. Existence of subsidy functions. After the previous heuristic arguments we 
now pass to a more rigorous analysis. In order to find dual maximizers, we will first 
prove that there exists a subsidy function / which satisfies (jS2[) for a given finite 
transport plan tt. Note that in the subsequent Proposition we do not assume that 
c is fj,(E) ly- a.. e. finitely valued. 

Proposition 3.2. Let X, Y be Polish spaces equipped with Borel probability mea- 
sures II, V. Let c : X X Y —t [0, oo] be Borel measurable, assume that n is a 
finite transport plan and set a ~ Ic[t^] ^ ^c > 0. Then there exists a function 
f : X xY —f [0, c»] such that J f di: ~ a and, for all {xi, yi), . . . , {xn, yn) (^ X x Y , 

n 

(25) ^c{xi+i,yi) + f{xi,yi) - c{xi,y,) > oQ 



We prefer to write +f{xi,yi) — c{xi,yi) rather then —{c{xi,yi) — f(xi,yi)) in view of our 
convention oo — oo = oo. 
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The main ingredient in the proof Proposition l3.2l is the following duality theorem 
due to Kellerer (see |Kel84| Lemma 1.8(a), Corollary 2.18] and |Kel85| p 212]). 

Theorem 3 (Kellerer). Let Zi, . . . , Z„,n > 2 be Polish spaces equipped with Borel 
probability measures tti, . . . , 7r„ and assume that C : Z = Zi x . . . x Z„ —^ [— oo, oo) 
is Borel measurable and that b — sup^ C is finite. Set 

(26) /c:=inf| /" C d^ : k e n(7ri, . . . , 7r„)| , 

{n „ n \ 

"^ 4>i dn^ : C{zi , . . . , 2„) > ^ 0^ (z^) L 
i = l •'^i i = l J 

where 0i, . . . , (/)„ are Borel functions taking values in [— oo, oo]. 
Then Ic = Sc- 

We will use it in the following form: 

Corollary 3.3. Let Z be a Polish space equipped with a Borel probability measure tt. 
Let e : Z" — > [0, oo] be a Borel measurable function such that e(zi, Z2, . . . , Zn-i, Zn) = 
e{z2, Z3, . . . , z„, zi) for all zi, . . . , z„ G Z and let a > 0. Assume that 



(28) 



sup < / edn : K £ n(7r, . . . , tt) > < na. 



Then, for S > 0, there exists a function / : Z ^ [0, 00] such that e(zi, . . . , z„) < 
f{zi) + ... + f{zn) and J fdiT <a + S. 

Proof. Applying Kellerer's Theorem to the function C = — e, we find functions 
/i, ...,/„: Z ^ [0,00] such that e(zi, . . . , z„) < /i(zi) + . . . + /„(z„) and / /i d7r + 
... + Jfnd7:<n{a + 6). Set f{z) = (/i(2) + . . . + fniz))/n. Then 



(29) e(zi, . . . , z„) = - V e(cr'=(zi, . . . , z„)) 

n ■^ — ' 

k=0 
-. n — 1 

(30) <- V/i(2i+fc) + ... + /„(z„+fe) 

n -"^ — ' 

k=0 
-. n — 1 n — 1 

(31) - - ^ /i(zi+fe) + . . . + /„(zi+fe) - Yl /(^1+fc)' 



fc=0 fe=o 

where cr(ri, . . . ,r„) = (r2, . . . , r„+i). D 

Lemma 3.4. Let X,Y be Polish spaces equipped with Borel probability measures 
fj-jiy, let c : X X Y —f [0, 00] be Borel measurable and assume that n is a finite 
transport plan satisfying IcIt^] < Lc + a. Set 

n 

(32) e{xi,yi,...Xr,,yn) -^ i^Yc{xi+i,yi)-c{xt,yi)j > 0. 

i=l 

Then 

(33) sup < / edn : K ^ n(7r, . . . , tt) > < na. 

[■/(JfxY)" I 
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Proof. Denote hy a,T : {X x F)" ^ {X x F)" the mappings 

(34) a- : {x„yi)'^^^ ^ {x^+l,y^+l)1^l 

(35) T : {xi,yi)'^^^^ {xz,yt+i)"=i- 

Observe that u" = r" = Id(xxy)" a-nd that a and r commute. By pi : (X x 
Y)" ^ X X Y, {xi, yi, . . . , Xn, yn) '-^ (xi, yi) we denote the projection on the i-th 
component of the product, while the projections px '■ X xY -^ X^{x^y) i-^ x and 
Py '■ X xY ^Y,{x,y) i^ y are defined as above. 
Pick K G n(7r, . . . , tt). By replacing k with 

(36) -(K + cr#K+... + (cr)"-l#K), 

we may assume that k is cr-invariant. Set -B = {e > 0} and consider the restriction 
of K to i? defined by fi{A) — n{A\^B) for Borel sets A <Z [X x y)". k is cr-invariant 
since both the measure k and the set B are cr-invariant. Denote the marginal of k 
in the first coordinate {X x Y) of {X x F)" by tt. Due to cr-invariance we have 

(37) Pi#k ^ p,^{a#k) 

(38) = (p, ocr)^K = Pj+i#k, 

i.e. all marginals coincide and we have k G n(7r, . . . , tt). Furthermore, since k < k, 
the same is true for the marginals, i.e. tt < tt. Denote the marginal pi^{T^k) of 
T^k in the first coordinate {X x Y) of {X x F)" by Up. As a and r commute, t^k 
is cr-invariant, so the marginals in the other coordinates coincide with tt^. Note 
that 7r^(X xY) = k{^{X x F)") = 7r(X x Y). Moreover tt and irp have the same 
marginals in X resp. Y . Indeed, let C C X, Z? C F be Borel sets. Then 

(39) pjf#7r^(C) = T#k{(px opi oTy^[C]) == 

(40) ^T^k{{xi,...,yn) : xi e C} = 

(41) = k{{xi, ... ,yn) : xi e C} ^ px#tt{C) 

(42) pYH^T^piD) = T^k{{pY opi oTy^[D]) = 

(43) =T#K{(a;i,...,y„) : j/i e £>} = 

(44) ^ k{{xi, . . . ,yn) ■■y2^D}=pY#^{D). 

This enables us to define an improved transport plan by 

(45) 7r/5 = (tt-tt) -f 7r/3. 

Since tt < tt, we have that (tt — tt) is a positive measure, hence (j45[) defines a positive 
measure as well. Since tt and tt^ have the same total mass, irp is a probability 
measure. Furthermore tt and iTp have the same marginals in X, resp. F, so np is 
indeed a transport plan. It remains to apply the assumption that the transport 
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costs of 7r/3 cannot be cheaper by more than a than the ones of ir: 

(46) a>/c[7r]-/cM 

(47) = / cd{TT-TTf:s) 

JxxY 
1 " /■ 

(48) =~y^/ cop^d{k-T^k) 



1 " /■ 

(50) ^-V"/ c{xi,yi)~c{xi+i,yi)dK{xi,...yn) 

(51) = - / e(a;i,. ..,?/„) d^ 

nJB 

(52) = - / e(a;i,. ..,yn)dK. 

'^ J(XxY)" 

Since k £ n(7r, . . . , tt) was arbitrary, this yields ([55)1 . D 

In the proof of Proposition 13. 21 we shaU apply a result of Komlos f |Kom67j ). 

Lemma 3.5. Let (/„),i>i be a sequence of measurable [O,oo]-valued functions on 
a probability space {Z,n) such that sup„>j^ ll/nlli < oo. Then there exists a subse- 
quence (/„)„>! such that the functions 

(53) -(/i + ... + /«), n>l 

n 

converge 'K-a.e. to a function taking values in [0,oo]. 

In particular, there exist functions gn G conv(/„, /„_|_i, . . . ) such that (.g„)„>i 
converges 'K-a.e.. 



Proof. The first part of Lemma 13.51 is Komlos' original result which we will not 
prove. The assertion that there exist gn € conv(/„, /„+i, . . . ) such that {gn)n>i 
converges almost everywherqj is a simple consequence which we will derive for the 
sake of completeness. Assume that ^ (/i + . . . + /„) converges 7r-a.e. to a function 
g. Since all functions /„, n > 1 are 7r-a.e. finitely valued, for each fc > 1 there exists 
some rik such that 



(54) -[[nS^^^-- + f^)>i:\)<W 

Setgk^^{fk+i + ... + fn,+k)- a 

Proof of Provosition \3.2l Combining Lemma [?^ and Corollarv l3.3l we achieve that 
for each n > 2 there exists a function /„ such that J fndir < a + 1/n and for all 

(xi,2/i),...,(a;„,2;„) G X x y 

n 

(55) ^c{xi+i,yi) + fn{xi,yi) - c{xi,yi) > 0. 
1=1 



^In fact, this result holds true without any integrability assumptions, see | DS94I Lemma Al.l]. 
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Observe that for all p > 1 the function /„p satisfies (f55| as well since we can run 
through the cycle 

(56) {xi,yi) ^ ... ^ {xn,yn) -^ {xi,yi) ^ ... 

p times. Also note that any convex combination and any pointwisc limit of functions 
which satisfy ([55|) for some fixed n satisfies ([55l) (for the same n) as well. Thus we 
may apply Lemma [531 to find functions <?„ G conv(/„!, /(„+i)!, . . . ) which converge 
TT-almost everywhere. Defining g as the pointwise limit where this limit exists and 
oo elsewhere, yields a function which satisfies (j55p for every n. Moreover 



(57) I g diT = I lim inf (;„ (ivr < liminf / gndn = a, 

J J n — >oo n — >oc / 

hence the desired bound for the total costs of subsidy holds true. D 

3.2. Dual maiximizers by subsidized transport plans. The main goal of this 
section is to prove the sandwich-type result announced above. 

Proposition 3.6. Assume that X, Y are Polish spaces equipped with Borel proba- 
bility measures /i, v, that c : X x Y ^ (^ oo, oo] is Borel measurable and jjl ® v-a.e. 
finite and that c'.XxY—t [— oo, oo) is Borel measurable. If 

n 

(58) ^c{xi+i,yi)-c{xi,yi)>0 

i=l 

for all xi, . . . ,Xn £ X , j/i, . . . , y„ S Y , there exist Borel measurable functions 
(J) : X ^ [-00,00), -0 : Y -^ [-00,00) and Borel sets X' C X,Y' C Y of full 
measure such that 

(59) c{x,y) <(t){x)+i;{y) <c{x,y), 

where the lower bound holds for x £ X',y £ Y' and the upper bounded is valid for 
all X e X,y £ Y. 

Observe that Proposition 13.61 is in fact a generalization of Proposition 12.11 Let 
TT S n(^, ly) be a finite transport plan which is concentrated on a c-cyclically mono- 
tone Borel set F. Without loss of generality assume that c is finite on F. Set c = c 
and 



Then c, c satisfy the assumptions of Proposition 13.61 and thus there exist Borel 
measurable functions 0, ip which satisfy 4>{x) + ijj{y) < c(x, y) for all x € X,y € Y 
and 4'{x) + ip{y) > c{x,y) for 7r-almost all {x,y) (z X x Y , hence n is strongly 
c-cyclically monotone. 



Before giving the proof of Proposition 13.61 we need some preliminaries, in par- 
ticular we will recapitulate some facts from the theory of analytic sets. They will 
be needed to deal with certain measurability issues which arise in the course of the 
proof. 

Let X be a Polish space. A set yl C X is analytic if there exist a Polish space Z, 
a Borel measurable function f : Z ^ X and a Borel set B CY such that f{B) = A. 
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Lemma 3.7. Let X, Z be Polish spaces and g : X x Z ^ [—00, 00] a Borel mea- 
surable function. Set 

(61) (j){x) = inf g{x,z). 

Then {(f) < a} is analytic for every a € [00, 00]. 
Proof. We have 

(62) (j){x) < a <(=> 3z,g{x,z) < a. 

Thus {(f><a}=px [{g{x, z) <a}]. D 

Given a Borel measure /x on X, we denote its completion by jl. By a result 
of Luzin (see for instance |Kec95[ Theorem 21.10]) every analytic set A C X is 
the disjoint union of a Borel set and a /i-nuU-set. This will allow us to replace a 
function which only satisfies that the sets {</> < a}, a £ [—00, cxj] are analytic by a 
Borel measurable function. 

Lemma 3.8. Let X be a Polish space and 11 a finite Borel measure on X. If 
(j) : X ^ [—00,00] satisfies that {<f>{x) < a} is analytic for each a € [—00,00], 
then there exists a Borel measurable function (j) : X -^ [—00,00] such that 4> < 4> 
everywhere and (j) — (j) almost everywhere with respect to jl. 

Proof. Let (/n)^i be an enumeration of the intervals [—00, a), a £ Q. Then for 
each n g N, 0~^[/„] is /i-measurable and hence the union of a Borel set i?„ and 
a ji-miW set Nn. Let A^ be a Borel null set which covers IJ^i-^"- ^'^^ i'ix) — 
4i{x) — 00 • \n{x). Clearly 4'{x) < (j){x) for all x ^ X and (j){x) — 4'{x) for /i-almost 
all x ii X. Furthermore, (/> is Borel measurable since (/n)S^i is a generator of the 
Borel CT-algebra on [—00, 00) and for each n S N we have that 0^^[/„] = i3„ \ A^ is 
a Borel set. D 



Lemma 3.9. Assume that in the setting of Provosition [STm we have that 

(63) il{px [{c > -00} \{XxN)])^l 

for every ly -null-set N CY . Then there exist aii G X and a Borel set X' C X with 
fi{X') ~ 1 such that for each x Cz X' there are ?/i, y G Y satisfying 

(64) c(a;i,j/i) > -00, c{x,yi) < 00, 

(65) c(x,y)>— 00, c(xi,y)<oo. 

Proof Set 

(66) Xi = {x: ,y{{y : c(x, y) < 00}) = 1} 

(67) Yi^{y: tii{x:cix,y)< 00})^ 1}. 

By Fubini's Theorem, fJ,{Xi) = i^{Yi) = 1. Since fi{px[{c > -00} n (X x Yi)]) = 1, 
there exist xi G Xi,yi G Yi such that c{xi,yi) > —00. Since xi G Xi, 

(68) r':={j/Gri:c(xi,y)<(^} 
has j/-measure one. Consequently 

(69) X' ■.= {x:c{x,yi) < oo}npx[{c> -00} n (X x Y')] 



has full /i-measure. It remains to check that the assertions of Lemma 13.91 are 
satisfied. Choose x G X'. By definition of X', c{x,yi) < 00. Since x G px[{c > 
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—00} n (X X Y')], there exists some y &Y' such that c{x,y) > — oo. Since y G Y', 
c(xi,y) < 00. D 

Proof of Proposition \3.b\ Note that it is sufficient to define </) and -0 on Borel sets 
X' Q X,Y' CY with /i(X') = v{Y') = 1, since they can then be extended to X 
and Y by setting them — oo on the null-sets X\X',Y\Y' . This will be used several 
times in the course of the proof. 

Next we will show that it is sufficient to consider the case that fi{px[{c, > — oo}\ 
{X X N)]) = 1 for every i^- null-set N CY, such that Lemma [331 is applicable. Set 

(70) P := ini{fi{px[{c > -oo} \{Xx N)]) : N CY, iy{N) = 0}. 
Choose for each k > 1 a. t^-nuU-set Nk C Y such that 

(71) P+l>KPx[{c>-(x}\iXxNk)]) 

Set TV := IJ^i Nk,Y' = Y \ N and let M C px[{c > -oo} \ {X x N)] be a 
Borel set with ^(Af) = /?. Then fi{M n px[{c > -oo} \ {X x N')]) = /3 for 
every i/-null-set N' C Y' and it is sufficient to define </),?/; on M and F'; as above 
they can then be extended to X and Y by setting them — oo on X \ M resp. N. 
Thus we may assume without loss of generality that M — X or, equivalently, that 
Pjs:[{c > — oo} \ (X X N)] has full measure for every :/-null-set N C Y' . 
Choose xi and X' according to Lemma [331 and set 

(72) 0,i(a;) = inf i '^c{xi+i,yi) - c{xi,yi) : a;„+i = a; > 

(73) and (l>{x) — inf 4>n{x). 

n>l 

Fix a: £ X'. To see that — oo < (j){x) < oo, pick y, j/i £ Y' according to Lemma 
13.91 such that c{xi,yi) > —oo,c{x,yi) < oo, c(a:, y) > —oo,c{xi, y) < oo. Then 

(74) 0(x) < 01 (x) <c{x,yi) -c{xi,yi) < oo. 

To prove the lower bound, we pick n > 1 and X2, ■ . . ,Xn G X' , yi, ■ ■ ■ ,yn G ^' and 
set Xn+i = a:,y„+i = 2/,x„+2 = a;i. By ([58l), 

n+l 

(75) ^c(a;i+i,2/i) -c(xi,?;i) > 

j=i 

n 

(76) whence y2c{xi+i,yi) - c{xi,yi) > -[c{xn+2,yn+i) - c(x„+i, j/„+i)] . 

^=l ' « ' 

Taking the infimum over all possible choices of rt > 1 and X2, ■ ■ ■ ,Xn G X' , j/i , . . . , y„ G 
Y' , we achieve that (f>{x) > — [c(xi, y) — c{x, y)] > — oo. 

Next observe that for x, x' G X', y E Y' and n > 1 

(77) 0„+i(x) < inf <^ '^c{xi+i,yi) - c{xi,yi) : Xn+2 = x,Xn+i = x',yn+i = y > 

(78) =0„(x')+[c(x,y)-c(a:',y)] 
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Taking the infimum over n > 1 yields that 

(79) (/.(x) < inf (t>n+i{x) < 0(x') + [c{x,y)-c{x\y)] 

(80) =^ c{x\y)-cb{x')<c{x,y)-cj}{x). 

At this point we wiU take care about measurablHty of 0. First apply Lemma [3771 
to the spaces X' and Z = lj^i(^' ^ ^)" to see that {(j) < a} is analytic for each 
a G [— oo, oo]. Then we may shrink X' a little bit to achieve, by Lemma l3.8l that 
is even Borel measurable. Note that ([75]) is then still vahd for all x, x' € X', y G Y' . 

By Fubini's Theorem, for :^-almost all y G Y' there exists some x G X such that 
c{x,y) < oo. By shrinking Y' a little bit if necessary, we may assume that this is 
the case for all y G Y' . Then the function 

(81) '0(y) := inf c(a;,y) -(/)(x) 

is finitely valued on Y' . As above, we apply Lemma 13.71 and Lemma 13.81 and shrink 
Y' a little further to achieve that tp is Borel measurable. Moreover 

(82) c{x,y)<^{x)+^P{y)<c{x,y) 

holds for all xeX',yeY'hy^£ind^. D 

Proof of Theorem\^ Pick an arbitrary finite transport plan tt S n(/i, v). Choose a 
subsidy function f : X xY ^r [0, oo] according to Proposition 13.21 such J f dn < 
Ic[7t] - Ic- Set c{x, y) = c{x, y) and 

c{x,y)- f{x,y) ifc(a;,y)^oo 



^ ' Qx^tV) ■ 1 _QQ ifc(a;,y) = oo ' 

in particular c = c — f holds 7r-a.e. since c is 7r-a.e. finite. For all a;i, . . . , a;„ e A" 
and yi,...,yneY, 

n 

(84) ^c{xi+i,yi)-c{x„yi)> 

i=l 
n 

(85) ^c{xi+i,y,) + f{x,,yi) -c{xi,y{) > 0, 

i=l 

Thus by Proposition l3.61 there exist functions (j) : X ^ [—oo, oo), ip :Y —> [— oo, oo) 
such that (l){x) + ip{y) < c{x,y) = c{x,y) for all x E X,y £ Y and (j>{x) + '(p{y) > 
c{x,y) = c{x,y) — f{x,y) for 7r-almost all {x,y) £ X xY. This implies 



(86) /c>^(0»> / c{x,y) - f{x,y) dnix,y) > I^t:] - [Un] ~ I,) ^ I„ 

thus Ic — J{4>, "0) s-iid hence 0, tp are dual maximizers. D 

To conclude the proof of Theorem [1] we show Lemma 11.11 
Proof of Lemma \1.1[ Similarly as in JST08] we define 

4>nix) = (-n V 4){x)) A n, 
V'n(y) = (-"-V ■0(2/)) An, 

^n{x,y) = 4>n{x) +1pn{y), 

^{x,y) = 4>{x) + 0(2/), 
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for {x,y) £ X X Y and n > 1. Observe that £,n ^ ^ on {^ > 0} and ^„ i ■? on 
{£. < 0}, as n -^ oo. Moreover f „ < c so that (0„, i/'n) € $(^, j^). 

AdditionaUy, J^dn < JcdTr, J^dTr < J cdn exist, taking possibly the value 
— oo, since ^ < c holds tt- almost surely as well as tt- almost surely and we assume 
that J cdiTjJcdTr < oo. By the assumption on equal marginals of tt and tt we 
obtain 



(87) I Cnd-K = I (j)„ dn + ipn d-K 

ndiT + / tpn dn 



(89) = J Cn d^, 
for n > 0, hence 

(90) / 6il{c>o} + '?ril{5<o} d-K = I C,il{5>o} + 6il{c<o} di:. 

By our previous considerations we can pass to the limits and obtain J ^ dn — J ^ dn. 
Indeed the limits are monotone on {^ > 0} and on {^ < 0} and the convergence is 
dominated by c on {^ > 0}. Hence the limits oi J ^^ndn — J ^„ dn exist as n — > oo 
and are equal. Consequently J ^dir — J ^ dix. D 

4. Examples 

We start with a simple example which shows that Monge-Kantorovich duality 
does not hold in general for a measurable cost function c : X x F — > [0, oo]. 

Example 4.1. Let X ~Y =[0,1]^ ^ = v the Lebesgue-measure and set 

{oo for X < y 
1 for X = y 
for X > y 

for {x,y) (z X X Y . The optimal (and in fact the only finite) transport plan n is 
concentrated on the diagonal and yields costs of one. Assume that (j), -0 : [0, 1] -^ 
[— oo, oo) are integrable functions satisfying (j){x)+ip{y) < c{x, y) for all x,y € [0, 1]. 
Then 

//• pi — a 

4)dii+ -ipdiy ^ hm / (t){x + a) + V'(a;) dx 

1*1— a 

(93) < lim / cix + a, x) dx = 0. 

Thus there is a duality gap, i.e., 

(94) l = Ic> sup{ J(0, V) : (0, ^) e $(m, i^)} = 0. 

Note also that Ic fails to be continuous in the sense of Corollary 11.31 For each 
n > 1, there exist transport plans assigning arbitrarily small costs to the function 
c /\n such that lim„_^oo Ic/\rL = ^ 1 = la- 
in |BGMS08l Example 5.3] a certain variant of Example 14.11 is considered. By 
setting c{x, y) = ^x — y for a; > y, the cost function becomes lower semi-continuous. 
In this case duality does hold true, but there are no dual maximizers, that is, the 
optimal transport plan tt is not strongly c-cyclically monotone. We present here yet 
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oo 


for X < y 


1 


for X = y 





for X > y 



another variant of Example 14.11 which shows that dual maximizers need not exist, 
even if the cost function is assumed to be continuous. 

Example 4.2. Let X = F = N U {uj} where we take cj to be a "number" larger 
than all n G N. Equip X and Y with the discrete topology and define ji = v such 
that positive measure is assigned to each point in X, resp. Y . Set 

(95) c{x,y)-- 

for (x, y) € X X Y . As above we find that the only finite transport plan tt is 
concentrated on the diagonal and yields costs of 1. Since X and Y are discrete 
spaces, c is continuous with respect to the product topology on X y.Y and hence 
duality holds true. Striving for a contradiction, we assume that there exist dual 
maximizers (/>, '0 : NU {w} — > [—00,00). Note that and V' are necessarily finitely 
valued since X and Y have no non trivial null-sets. Since 0, ij: witness that tt is 
strongly c-monotone we have 

(96) 4>{n) + -4}{n) = 1, 0(n + 1) + ^(n) < 0, and 0(w) + 0(n) < 
for each n G N. This yields 

(97) 0(n + 1) < 0(n) - 1, and 0(w) < 0(n) - 1 
for all n G N which is impossible for a finitely valued function. 



One can try to overcome the difficulties encountered in Examples 14.11 14-21 and 
[BGMSOSl Example 5.3] by admitting dual optimizers from a larger class of map- 
pings: Consider functions cj) : X ^ [— 00,00), 0; : Y —t [—00,00) which do not 
necessarily satisfy the inequality (j){x) + ip{y) < c{x,y) for all {x,y) & X xY but 
do satisfy the potentially weaker condition 

(98) 0(2;) + "ipiy) < c{x, y), TT-a.e. for every finite transport plan n G n(/i, i^). 

It follows then that 7(0,-0) = /xxy '^(^) ^ i^iu) dT^i^iU) ^ Ic[t^] for each finite 
transport plan tt G n(/x, j/), such that 

(99) J(0,0)< mf Ic[n]=Ic. 

Therefore it is reasonable to consider all pairs of functions (0, -0) satisfying ((98)) 
as admissible solutions of the dual part of the Monge-Kantorovich problem. In 
particular a transport plan tt is optimal provided that there exist measurable (not 
necessarily integrable) functions : X — > [— 00,00), -0 : Y — > [—00,00) satisfying 
and 



(100) / [<j)ix)+^j{y)]d7r{x,y)= cd^. 

JxxY JxxY 

Clearly, (|100p is tantamount to requiring that 4>{x) + ip{y) = c{x,y) for 7r-almost 
all {x,y). Observe that dual optimizers in this weak sense exist in Examples 14.11 
and 14.21 Since the only finite transport plan is the optimal one, we may just take 
= ^ = 1/2. However the subsequent construction (which is a variation of [AP031 
Example 3.5]) shows that, in general, dual optimizers do not even exist in this weak 
sense. 
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Example 4.3. Let X = Y = [0, 1], equipped with Lebesgue measure \ = ji = v. 
Pick a £ [0, 1) irrational. Set 

To = {{x,x) : X e X}, Ti = {{x,x®a) : x £ X}, 

where ® is addition modulo 1. Define c : X x y ^ [0, oo] by 

r for(a;,y)Gro,a;e[0,l/2] 

(101) cix,y)=\ ] for (x,y)e To,. G (1/2,1) 
^ ' ^ '^^ I 1 for (x,y) e Ti 

[ oo else 

Note that c is lower semi-continuous whence duality holds (cf. |Kel841 Theorem 
2.2]). For j = 0, 1, let ttj be the obvious transport plan concentrated on F^. Then 
all finite transport plans are given by convex combinations of the form qttq + (1 ~ 
gJTTi, g € [0, 1] and each of these transport plans leads to costs of 1. Assume that 
(p^Tp : [0, 1) — > [—00,00) are measurable functions which satisfy 

(1) J(<^,^) = 1, 

(2) (j){x) + i^{y) < c{x, y) for ttq- and TTi-almost all {x, y). 

This implies that, in fact, 4>{x) + ip{y) — c{x,y) for ttq- and 7ri-almost all (x,y). 
Thus 

(102) 0(x) + Vi(x) = | ^ forxeaAi) ' <^(^) + Vi(^®") = i 

nnQ^ \. h r^ \ / '/'(2^) + l for a; e [0, 1/2] 

103 whence 4>{x ® a) = \ ^ ' ^i/oi\ 

1^ W{x) — 1 for x e (1/2, 1) 

for all X G I, where / C [0, 1] is a Borel set of measure 1. By passing to a subset of 
full measure, we may additionally assume that /©a = /. Pick a set B C I such that 
X{B) > and sup^g^'0(a;) — mixeBipix) < 1. By a classic theorem of Steinhaus, 
B — B contains a non-empty open set. Since a is irrational, {(2n -I- l)a : n G N} is 
dense in (0, 1], thus there exists n G N such that x,x (B {2n + l)a S B. By (|103[) 
this implies that li^ix © (2n + l)a) — tp{x)\ > 1, contradicting the definition of B. 

We conclude with two examples which show that in general it is not possible 
to find dual maximizers (j) and ip which are integrable even if c is continuous and 
finitely valued. 

Example 4.4. Let X = Y ^ (0, 1], take n, v to be the Lebesgue measure and set 
c{x,y) = \l/x — 1/y + 1|. Define n to be the transport plan concentrated on the 
diagonal such that /c[7r] = 1. The functions cj){x) = 1/x, tpiy) = 1 — l/y witness that 
TT is strongly c-cyclically monotone and hence optimal. Recall that by Corollarv ll.2l 
functions that witness strong c-cyclical monotonicity of a transport plan correspond 
precisely to dual maximizers. In particular, (j) and ip are dual maximizers. Of course 
(p and tp are not integrable. Let (0, V') be another pair of dual maximizers. We will 
see that there is a constant /3 € R such that (j> = (j) + P,tp = ip — pi almost surely, 
hence there exist no integrable dual maximizers. 

Fix a,b £ (0, 1] such that a < b and (1/x — 1/y + 1) is positive on [a, 6]^. Let 
TTa.b be the transport plan which equals n on X x Y \[a,b]'^ and is l/(b — a) times 
the product measure on [a, 6]^. As above, and tp witness that iVa^b is strongly 
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c-cyclically monotone and thus optimal. But then also 4>, ip witness that TTa.h is 
strongly c-cyclically monotone, hence 

(104) (j}{x)+^{y)^c{x,y)^${x)+^{y) 
for almost all {x,y) G [a, 6]^- This yields 

(105) i<j>~$){x)^{i^-^){y), 

thus both sides are almost everywhere in [a, b] equal up to some constant Pa,b- Since 
we can cover (0, 1] with sufficiently small overlapping intervals [a„,6„],n > 1, we 
achieve that this constant Pa,b ='■ P does in fact not depend on the choice of a and 
b. It follows that indeed (f> — (p + f3 and ip = ip — f3 almost everywhere. 

In fact, one can find an example displaying the same phenomenon as Example 
14.41 where c is just the squared distance on M. 

Example 4.5. Let X — Y = M., let c{x, y) ^ {x — y)^, let /i be a Borel probabilty 
measure on X, define T : X —i- Y by T{x) = a; + 1 and assume that v — T^ji. 
Let TT G n(/i, ly) be the obvious transport plan concentrated on the graph T = 
{{x, x + 1) -.xem} oiT. Set (j){x) = -2a;, tP{y) = 2y-l. Then 

(106) cix, y) - 4>{x) - My) = (x - y)^ - 2{x - y) + I = {x - y - 1)^ 

is non negative for all {x,y) € X x Y and 4>{^) + '/'(?/) = c{x,y) holds precisely 
for {x,y) € r. Thus tt is strongly c-cyclically monotone and the corresponding 
transport costs J^IcItt = 1 are minimal. 

We claim that the dual optimizers (f>, "ip are essentially unique if the Lebesgue 
measure A is absolutely continuous with respect to //. 

Let 0, -0 be dual optimizers. Since J{4>, ip) — Jxxy ^i^) '^ '^{v) dn = I, we have 
<j){x) + ipiy) = c{x,y) = 1 for 7r-almost all {x,y) E X x Y. Thus there is a set 
X' CX with X{X \ X') = ^l{X \X') = such that 

(107) pix) + ?/;(l + x) = c{x, a; + 1) = 1 =^ V^(l + a:) = 1 - 0(a;) 

for aU X e X'. Since ^{x) + %p{l + y) < c{x, 1 + y) ^ {x - y - 1)^ for all x,y eR, 
(fT07|) yields that for allx & X,y & X' 

(108) Mx) + l--(b{y) <ix-y-l)^ 

(109) =^ {q>ix) + 2x)-{q,{y) + 2y) < [x - y - if - 2{x - y - I) + 1. 
Setting f{x) — (j>{x) + 2x, (|109p is tantamount to 

(110) fix)-f{y)<{x~yf 

for all X G X,y E X' . Thus / is constant on X', whence there exists a constant 
/? e M such that 

(111) cP{x) = -2x + f3 = Pix), ^P{y) = 2y-l + f3 = i,{y). 

for A- as well as /z-almost all a; G X and A- as well as z^- almost all y E Y. 

If we pick /i such that A is absolutely continuous with respect to /i and such that 
L X d^{x) does not exist, (j) and ip are not integrable and by (jllip no other pair of 
dual maximizers can be integrable either. 
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